Control Capacity of Partially Observable Dynamic Systems in Continuous Time
نویسندگان
چکیده
Stochastic dynamic control systems relate in a probabilistic fashion the space of control signals to the space of corresponding future states. Consequently, stochastic dynamic systems can be interpreted as an information channel between the control space and the state space. In this work we study this control-to-state informartion capacity of stochastic dynamic systems in continuous-time, when the states are observed only partially. The control-to-state capacity, known as empowerment, was shown in the past [1]–[6] to be useful in solving various Artificial Intelligence & Control benchmarks, and was used to replace problem-specific utilities. The higher the value of empowerment is, the more optional future states an agent may reach by using its controls inside a given time horizon. The contribution of this work is that we derive an efficient solution for computing the control-to-state information capacity for a linear, partially-observed Gaussian dynamic control system in continuous time, and discover new relationships between control-theoretic and information-theoretic properties of dynamic systems. Particularly, using the derived method, we demonstrate that the capacity between the control signal and the system output does not grow without limits with the length of the control signal. This means that only the near-past window of the control signal contributes effectively to the control-to-state capacity, while most of the information beyond this window is irrelevant for the future state of the dynamic system. We show that empowerment depends on a time constant of a dynamic system.
منابع مشابه
A Partially Observable Markovian Maintenance Process with Continuous Cost Functions
In this paper a two-state Markovian maintenance process where the true state is unknown will be considered. The operating cost per period is a continuous random variable which depends on the state of the process. If investigation cost is incurred at the beginning of any period, the system wit I be returned to the "in-control" state instantaneously. This problem is solved using the average crite...
متن کاملPartially Observable Nonlinear Risk-Sensitive Control Problems: Dynamic Programming and Veri cation Theorems
In this paper we consider continuous-time partially observable optimal control problems with exponential-of-integral cost criteria. We derive a rigorous veriication theorem when the state and control enter nonlinear in the dynamics. In addition, we show that the quadratic sensor problem is estimation solvable with respect to a certain cost criterion. The framework relies on dynamic programming ...
متن کاملA POMDP Framework to Find Optimal Inspection and Maintenance Policies via Availability and Profit Maximization for Manufacturing Systems
Maintenance can be the factor of either increasing or decreasing system's availability, so it is valuable work to evaluate a maintenance policy from cost and availability point of view, simultaneously and according to decision maker's priorities. This study proposes a Partially Observable Markov Decision Process (POMDP) framework for a partially observable and stochastically deteriorating syste...
متن کاملOptimal Control of Partially Observable Linear Quadratic Systems with Asymmetric Observation Errors
This paper deals with the optimal quadratic control problem for non-Gaussian discrete-time stochastic systems. Our main result gives explicit solutions for the optimal quadratic control problem for partially observable dynamic linear systems with asymmetric observation errors. For this purpose an asymmetric version of the Kalman filter based on asymmetric least squares estimation is used. We il...
متن کاملIncremental Sampling based Algorithms for State Estimation
Perception is a crucial aspect of the operation of autonomous vehicles. With a multitude of different sources of sensor data, it becomes important to have algorithms which can process the available information quickly and provide a timely solution. Also, an inherently continuous world is sensed by robot sensors and converted into discrete packets of information. Algorithms that can take advanta...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1701.04984 شماره
صفحات -
تاریخ انتشار 2017